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ABSTRACT 


This thesis is concerned with the design and 
development of Prettyprinters - an increasingly 
familiar ccn^onent of any programming environment '• 
Throughout this work^ the emphasis iis oa fofmalisation 
aimed at minimising , the systems prograimiing effort 
involved in developing such packages • To this end, 
a step has been taken towards developing a pretty- 
printer generator for LL{1) languages'; Two pretty- 
printers for PASCAL, catering to two different 
prettyprinting conventions, have been designed and 
irtploment ed '• 
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INTRCDUCTION 

1-1, PROGRTimiNG ENVIRONMENTS (PEs) 

The primary purpose of a programming Language (PL) 
is to provide a vehicle for expressing an algorithm as 
a program'.' PEs facilitate and encourage the use of a 
PL by providing the user a conducive environment to 
develop, troubleshoot and run his program, through 
nimerous aids and supports'.' The scope cf the term is 
best exemplified by a description of its constituent 
packages'* These nay be classified under two heads - 
Static and Dynamic'^' 

. 1-1 '.'1 PEs Concerned with the Static Aspects of a 
Program: 

Typical packages in this category are: 

(i) Prettyprinters : They format the text of 
program to present a pleasing display of 
the text, highlighting its logical and syn- 
tactic storucture'. 

(ii) Cross-reference (CREF) gerarators; Provide 
a cross reference listing of the definition 
and corresponding uses of all variables and 
procedures/ functions in the program and the 
static nesting of blocks within the program'^' 


1 



2 


(iii) parsers and Context Sei^Ltive Amlysers: 

The former parsers an input program according 
to the context free specifications of the PL, 
while the latter caters to the context sensi- 
tive issues of the PL like definition bef ore- 
use^ scope identifiers and type compatibility V 

(iv) Editors: Provide the user with an inter-active 
environment^ tailored to suit his PL/ to enter 
his program text'.' iua editor normally provides 
these facilities by incori^orating a parser and 
possibly/ a prettyprinter to support its routines'^' 

(v) S our ce-to -Source Language Transformers: These 
xise a set of standard/ rteaning preserving 
transformations to produce logically equivalent 
versions of the input source language text/ e.g./ 
replacing an end-recursion by a ‘go to* and 
other recursions through explicit stack/ ete'i' 

(vi) Static Program Profiles: These provide a 

quantitative measure of the usage of various 
PL structures',' For variables/, it gives the 
nuinber of times it is referred in an expression or 
a var-access';' For procedures/ftontions/ it counts 
the nimber of calling instances', i^sideS/ 
statistics with regard to nesting and use 
various data and control structuring f aeSLities 
^ also reported'/ 
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1—1 PEs ConcernGd with the Dynamic Aspects of a 
Program: 

Typical packages of this class are: 

(i) Run-time diagnostics like memory dumps and 
stack analysis: 

If a program encounters a fatal error during run 
time^ the fojaner enables the user to examine a sr&pj^ot 
of the states of all variables at the time of crash, 
while the latter provides a backward trace of procedure/ 
funfction calls from the point of error to the main program'* 

(ii) Interactive Debugging Facilities: 

These enable the user to interrupt program execu- 
tion whenever control passes through any point speci- 
fied by him in the source code'.' He can then examine the 
states of all variables accessible at that point, assign 
new values .to variables, if necessary, obtain a trace 
of the flow of control upto the break point and other 
such facilities;’ 

X-2'*RELATien BETWEEN PEs AND PL STRUCTURE 

While the definition of a PL ^es take into account 
factors affecting its possible implemantation and use, 
the actual environiront for its usage is provided 
the elenents constituting its PE'. PEs, therefore, deal 
with live issxjes concerning actuaj use of a pl';: Any 
pjackage in a PE ought to be modularized as a PL-dependent 
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part, an application dependant part and an Operating 
System (OS)/machine architecture deperdent partV This 
modularisation would emble the automatic generation 
of at least the PL-dependent j^rff m this thesis, 
we conciously seek such a delineation for Prettyprinters'f 
1~3, STRUCTURE OP THE TIES IS 

The main body of the thesis is aoncemedwith pretty- 
printing - a familiar issue concerning one of the static 
aspects of a pl. Chapter 2 introduces the subject and 
c rrii„s out a critical appraisal and classification cf 
various prettyprinting algorithms published so far;- Based 
on this assessment, it selects an algorithm to be imple- 
mented for prettyprinting blockstructured / LIS> -like 
PLs, The third chapter briefly describes the algorithm 
to be implemented and the refinements or modifications 
made in the origiml algorithm'.' A set of prettyprinting 
standards is also defined fear PAS0-\L;- chapter 4 derives 
an extended prettyprinting granmar for PASCAL and MOIXJLA-2 
and describes the actual implementation of a prettyp: inteiy 
for PASC-\l;- Possibilities of automating prettyprinter 
generation are explored in the fifth chapter and a rough 
scheme for the same is proposed'. We conclude the efforts 
of our exercise in the last chapter, touching briefly 
upon the possibilities of adopting a similar approach 
in the design of other pe elements like interactive 
debugging systems"; 



CH;iPTER 2 


'■-HE CURRENT STx^iTE OF ART 

2-1, RE^iD/^iBILITY OP PROGRTiMS 

A program is considered to be readable if a progra- 
mmer is able t o read and understand if* Initially^ it 
was believed that programs would have very short life- 
times beyond their period of developnent and would be 
res^d only bv.y compilers'. This view obviously does not 
hold now^ for we know that a program can have a very 
long lifetime A typical utility program may be siibject 
to maintenance, changes and in^rovements, possibly to 
meet changes in specifications and/or environments. 

For such programs, readability is of utmost iit^crtancQ • 

In fact, readability is essential for developing large 
programs tooV This is especially true in case of pro— 
grairening teams, where close cooperation and good coirenu- 
nication are a must". 

Several facets of programming style affect 
readability',' Important amongst these are structured and 
modular program develop:nent , proper choice of a progra- 
itining language (PL), the choice of semantically relevant 
names, caranenting and finally, formatting',' The design 
of a PL also effects readability Some of i the clarity 
enhancement techniques used in language design are mirror— 
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image closing keywor<lS/ exterKled closing keywords (e*g'.V 
if cndif, for endfor etc,), prefix style inter-' 

mediate keywords (e'.'g'", SLSIF in Module ii) and others', 
our interest in this chapter lies in the formatting aspect, 
2-2 an IInTTRODUCTION TO PRETTYPRINTING 

2-2,1 Definition of the terms 

The term, coined by Ledgard [ledG- 75], was defined by 
him as *' prettyprinting is the spacing of a program to 
illuminate its logical structure" ,' The definition can be 
extended to include the illiomination of the syntactic 
structure as well, as is evident in the consistent aling— 
ment of syntactic constructs in blockstructured languages 
and other languages like LISP', Any conflict between the 
two requirements above implies either or both of the 
following: 

(i) The program needs restructuring, 

(ii) The choice of an unsuitable PL has resulted 

in an unnatural twisting of the program struc- 
ture to conform to the limitations imposed by 
the gjanguage .' 

2-=:^'2V2 Importance of Prettyprinting: • 

The objectives of prettyprinting, outlined above, 
xonderscore its Importance, In LISP lile dhnguages , where 
the main delimiters are parenthesis or spaces, a program 
or an S-expression is visually intolerable unless pretty- 
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printed';' so prettyprinters are conmon components of a 
LISP environment'* An exati^le is the LISP editor in the 
UCI LISP system, which incorporates a prettyprinter V 

Editors for any blocks tructured language, in fact, would 

from a prettyprinter. With the proliferation of PEs 
benefit enormousl;^for bloclcstructnred languages, auto- 
mated prettyprinting premises to become an inseperable 
part of such systems',' 

2— 2','3 Automating the Prettyprinting Process: 

A program which takes in an input of a stream c£ 
characters representing a program in the specified language 
and outputs a prettyprinted version of the input is called 
a prettyprinter'* 

2-2'*'3*1 Motivation for writing prettyprinters: 

(i) They relieve the user of the tedium of 
painstakingly formatting his program'* 

(ii) They help in ensuring a uniformity in 

prettyprinting style, i^hich is so essential 
to maintain good ccmmunication and efficiency 
amongst team members of a large softv/are 
project 

(iii) Availability of a prettyprinter allows the 

user to Store compacted versions of his progremi, 
with all spiorious blanks squeezed out'; The 
prettyprinter c^ then be used to generate a 
prettyprinted verslicm, whenever necessary ;■ 

This may save possibly valuable storage space'.' 
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(iv) They help novice prograrraners develop 
(cultivate?) a better appreciation of 
of the logical structure of the language, 
which comes out vividly in the prettyprinted 
program'.' They also help in detecting such 
common mistakes as a missing "end" in an 
ALGOL or PASC^KL program'^' 

2-2';'3','2 Need to Formalize Prettyprinting Rules 
for any Language: The inport ance of this aspect cannot 
be underestirreited.. Firstly, rules define a standard, and 
an accepted starKiard goes a long way in enhancing program 
readability';' Moreover, formalization enables us to 
describe the prettyprinting process as an algorithm, 
thereby making prettypr inters realizable';' 

2^'3.. APPR^.ISAL of EXxSTING PRETTYPRINTER algorithms 
I t is desirable that the preutyprirt er handle any 
input presented to it, irrespective of the typing conven- 
tions ertployed in the source code'.' The only reasonable 
assimption that can be made is that the inpit does not 
hav© 3ny gross syntactic blunders,' The context sensi- 
tive aspect of the syntax, like "definition before use" 
and type compatibility can, of course, be ignored, since 
they do not infliKsnce our process';' The worst case input 
to a prettyprinter may then be a CCTnpacted regresentaticn 
of a program with multiple blanks and other syntactically 
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redundant, layout characters squeezed oufi' To this end^ 
the idea of emiDloying the language syntax in scane manner^ 
to control the prettyprinting process autcanatically^ 
presents itself 7 A prettypr inter takes tv7o major dis- 
cussions at any stage of the formatting process - insert- 
ing linebroaks in the output and determining the inden- 
tation after every linebreakV The existing algorithms 
for prettyprinting can be classified roughly/ into six 
major categories as described belowT 

2-3,1 The first category consists of algorithms 
that are pxxrely concerned with rebuilding source files 
from compacted files’. Their job is to siirply output 
the source text with each statement on a separate line’,' 
Such schemes are satisfactory only for languages having 
a very strong statement structure (e’.’g,/ HDRTit'iN)’;' They 
fail miserably in LISP like languages or blockstructured 
langua-ges’l 

2— 3V2 The second class of programs use a fixed eet 
of key words such as BEGIN/ THEN and ELSE to trigger 
line breaks and/or indentationV This warrants only a 
minimal domputational overhead over scroa form of lexical 
analysis of the input text'! Such routines are /however/ 
rather sensitive to the style in \~hich the input is 
written! The output may, therefore/ tend to fluctuate 
between generating very long lines and sparse> zigzag 
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displays of nested blocKS'i' The former imrtKsdiately runs 
into trouble because of a finite width output medium 
and possibly, due to a compiler that accepts only fixed 
length input records';' The latter, besides wasting paper, 
might spread the program over too many lines to maintain 
any clarity, thus defeating the very purrpose of using a 
prettypr inter',' 

2-3 ','3 We now come to the third scheme, vSnich attempts 
to negotiate some compranise betvjecn the tv/o described 
above',' P«hile using a snell set of keywords to force in- 
dentation, it assess an aj^roximate ne asure of the pro- 
gramfe structirred complexity to handle the rest of the input; 
Ledgard*s [iH 77] and Jackel’s [jACK 80] algorithms are 
representatives of this class',' The action of. suehEonratters 
is shown for a section of LISP code below: 

(i) (function argument 1, argument 2 arguii^nt 3) 

(ii) (function argxament 1 
argument 2 
argun^nt 3 ) 

For a large available space, the resulting output is (i), 
otherwise it is (ii)';' Evidently, the most cnritical decision 
in this process is to determine whether the next sublist 
is ’’small" enough to fit the current line'; If this 
decision making process is not sophisticated enough, there 
is a tendency to occasionally outpxJt lines that are lunger 
than desirable'; This aspect, and a tendency to get baffled 
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by obscure cortiblnations of control structures are the two 
m^ln disadvantages of the Ledgard's algor ithmV Jackel 
uses an abbreviated form of pascal syntax which is 
suppo.^ed to highlight the prettyprinting aspects while 
suppressing “irrelevant" syntactic details;’ The gjarser 
for this syntax fills up an in^out buffer with the program 
text, interspersing linebreaks where required’,’ Thep.a 3 fse«' 
implements any possible break points encoiontered after 
filling up half the output buffer’; The dangers of the 
output marching off the ri^t margin, therefore, remain’,’ 
Nevertheless the idea of using a parser of the language 
to drive the prettypr inter may be used with advantage , 

2-3.’«* The fourth class of prettyprinters [GOLD 73]^ 
which make a prepass over the program tree , avoid all the 
problems mentioned above’,’ The prepass measaures the pre- 
cise size of subtrees in the printed form as a iteans of 
determining wtere to break lines’,’ The print pass which 
follows uses these sizes in printing the program’; Evi- 
dently, the cost of making two passes, and the space 
needed to store the sizes (or time required to recalculate 
them in the sec»nd pass) is high’; The advantages are extre 
me flexibility and an ability to tackle awkward nestings',’ 
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2“'3,5 Algorithms used for applications lite type- 
setting technical text for publication or displaying 
mathematical formulas etc'.', may be grouped under the 
fifth class of prettypr inters , The picture implcman- 
tatiohs Tcpresentative of this class are TEX [knut 78] 
and the picture ccmpiler techniques [I'dART 67 ]v The 
latter essentially builds a data structure to represot 
the entire program in its prettyprinted foaan before 
making a pass over this data structure to output it •' 

2-3 TS The discussion on the third category allud- 
ed to the use of a parser of the language to control 
the process',’ Any apprehensions orB may have about the 
cost of this idea may be dispelled by thB fact that 
most of the time shall be spent in lexical analysis and 
inpat— output/ since the formatting process needs only 
the context f;ee information. The minimal overhead 
incurred in using such a parser is more than offset by 
the benefits derived fran it in program develaganenf. 

The sixth and final class of algorithms use a parser 
of the language to drive the prettypr inter / which 
conceptually consists of tw-o parallel process<os,' The 
algorithms enveloped Icy Oppen [CPPE 80 ]/ Hear>n ^nd 
Norman [hn 79 ] and Jim Mcarris are representatives a£ 
this class. In absence of any literature on the last 
algorithm/ we shall briefly discuss the first two';' 
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Og>pen*s algorithm consists of two processes SCAN 
and PRINT'.' SCAN accepts a stream -of TOKENS, Tnhich re- 
present the input progrcim and its prettyprinting require- 
ments V h TCKEN is either a character string, or a ‘^e- 
limiter*t' The latter, which provides indentation infor- 
mation, is evaluated by SCAN when it fills up a buffer 
with these TOKENS;' When the total length of the un- 
processed TOKENS ( = E string 's'isos + 2 Breah-Blankspace) 
exceeds the width of the output nedium, SCAN repeatedly 
calls PRINT until the buffer is emptied of all character 
tokens'. If PRINT receives a character TOKEN it is 
out;^t forthwith; if it receives a “delimiter", it takes 
decisions on linebreaking/ indentation'.' Indentations for 
nested constructs are handled using a stack'. 

The two processes '• Printer 3id Foriretter - of Hearn 
and Norman* s algorithm, are analogous to Oppen's SCTiNr a-nd 
i- I-.T.'vT respectively, except that Printer has an inbuilt 
parser to generate indentation information, thereby avoid- 
ing a separate driver module'.' Printer feeds the charac- 
ter strings representing tte input progran into a FIFO 
buffer, inserting special markers containing indentation 
information, where a line may be broken. When the buffer 
is fxiLl the Formatter is activated'. This routine empties 
the buffer, breaking the line as directed by the special 
markers';' if, at the end of scanning a language construct. 
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printer finds that nane of the markers have been touched 
by Formatter, it overwrites them with blanks'” A simple 
is maintained between the two processes to 
control modification/use of a marker* 

2-4 ;■ A COMPARATIVE ASSESSI4ENT FOR SELECTION 

Having made a survey of various published algorithms, 
we turn towards the task of choosing a suitable one for 
our purposed Our inmediate interest is to develop a 
prettyprinter for P/^CAl and later, possibly, for other 
languages',' This autonatically eliminates the first, 
fourth and fifth- class — the first is too crude to handle 
blocks tructured languages Hike PASCAL, while the other two 
are too costly to employ in prettyprinting PLs', This 
leaves us with a choice between the second, third and 
sixth class V The former two, as we have already seen in 
the last section, have a tendency of occasionally gene- 
rating outputs that march off the right margin'i' The 
sixth class, with little additional overhead, manages 
to tackle the problem admirably by cleanly separating 
the processes of scanning the input text arid outputting 
the prettyprinted version';' Ihis allows linebreaking 
decisions to be delayed by an equivalent of one buffer 
length of source code',' The possibility of the output 
overflowing the right margin is thus eliminated; The 
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main forte of these algorithms^ however^ is their lan- 
guage independence Oppen*s algorithm achicives this to 
a greater extent than Hearn'* s',' The former is supplied, 
all PL-dependant information on possible line brenics 
and beginning and closing of constructs^ through the 
three special ''delimiters" /by a parser of the PLV Thus 
a clean separation of the PL depand .nt aspects of our 
problem., from its PL-independent aspects is achicv'ed 
and the interface between the tvr) is v/ell defihed<In the 
latter, the separation is not as clean since the first 
process - vizV, Printer — has to parse the input program 
itself to derive prettyprinting specifications’," Besides 
this issue, the three different delimiters of the former 
allow us to handle indentation decisions and linebreaking 
decisions independently, thus modularising the decision 
making process';' This mechanism helps us to avoid 
exception handling for cases like long identifier lists 
and the like, which is necessary in Hearn and Norman *s 
algorithm’. We, therefore choose Oppon’s algorithm for 
our implementation. 



CHAPTER 3 


IMPLEMEWTATICN of a PREITkTRINTER 
PHASE I 

Having chosen a proper algorithm for our purpose^ 
we turn towards formulating a formal sot of pretty- 
printing rules for the languages under consideration - 
viz'.v PiSCAL and MUDUiA-ii;' 

3-1, CHOOSING A SET CF PRETTYPRINTING RULES 

The proces s of prettyprinting is inherently sub- 
jective^ since it depends on the individual programmer's 
taste. This, in fact, holds for all readability cri- 
teria, There is, therefore, no universally accept 31e 
criterion for prettyprinting';' The enianeration of a 
set of prettyprinting rules depends on the programmer *s 
own tates and ejjperience',' For automation purposes, these 
rules must be stated un ambiguous ly'.' The rules put forth 
by HQiaras [hls 77], Peterson [pETE 77], Crider [cRID 78], 
GROGON.O [grog 79] and Mohiiner [mchi 78] were considered 
before formulating our specifications'; 

3—1,1 Of the above methods that have been published, 
Crider's method differs radically from the rest',' The 
general structure of his format is: 

introductory phrase 
dependoit clause 
depend-Qit clause 

' 

m . . 

dependent clause 
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.Thus typical control statements would take the 
forms I 


if conditions then begin 

statement 

sequence 

end 


repeat 

statement 

sequence 

until condition 


else if condition 2 then begin 
statarient 
sequence 

end 

else begin 

statement 

sequence 

end 


while condition do begin 


statement 
sequenc e 

end 


etc 


'Ihe symbols "begin" and "gnd" beccmie redundant in 
such a scheme and are put at the end of source lines;' 
The only exceptions are the begin and end of the state- 
ment part of a procedure/function/progrann , which are 
aligned'.' By treating these differently, we are able to 
emphasize the basic difference between the statement 
parrt and the compound statement, which is not evident 
in the conventional methods'.' in the conventional rules. 
Multiple indentations occur when a ccmpoimd statement 
follws an. IF, ELSE, MIILE and other clauses;' These 
indentations do not illianinate the logical structure 
in any way',' This is avoided in this technique. Indent- 
ing the UNTIL alongwith body of a SEPEi'i.T: emphasizes the 
fact that it forms a part of the repe at statement, and 
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is oxccutad as many titnas as the repeat statenent 
iterates, ' However, in Crider's scheme, it is nece- 
ssary to introduce redudant BEGIN V,'* END gairs and 
semicolons to maintain consistency ■«' The “indented end 
relationship" that he discusses, relates the indentation 
at any line containing an UNTIL or END to the following 
line, which during the process of coding, is non- 
existent V These considerations and the experience of 
usJag conventional layout, go against inclement ing 
this scheme for the user erv ironment here • 

3-l';'2 statement of Prettyprinting Rules for PLSG^L; 

The following are the rules followed by our pretty- 
printer: 

(1) The keyword PROOIAM is written in the left 
itarginV 

(2) The entire program/procedure/ fianct ion block 
is indented with respect to its heading* 

Thus the skeletal structure of a typical 
program looks like: 

program gRCXSNiME (parameters ) ; 

label declaration part; 
const declaration part; 
type declaration part; 
var declaration part; 

Proc/func declaration; 

Proc/func declaration; 


Proc/func declaration; 
StatOBQit part; 
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(3) Declarations are indented with respect to 
introduct03rY keywords^ viz.V label s 
const ^ type and var (Declindent) 

(4) Jkiy statement starting on a fresh line is 
started with an indentation corresponding 
to the level of nesting of control of that 
statement. The feasibility of starting every 
simple statement on a fresh line is questionable^ 
since it tends to produce long^ sparse lists 
which need more indentation to emphasize their 
nesting level,' This is especially true for 
assignment and proc/ftanc calls which a are much 
shorter than structured control statements'," 

So mtiltiple statements are allowed to be output 
on a single line, 

(5) Evcary statement/ declaration shall be written 
out without a line break unless its length 
exceeds the space available between the right 
ms^rgin and the current level of indentation 
or the standard to print it specifies 
othearwise'i' 
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(6) The statement sequence of eveiry ccmpound 
statement is indented Beginindent spaces with 
respect to its opening and closing teywords^ 
viz*# begin and end '*' The end of every ccmpoimd 
statement is aligned vertically below the begin . 

(7) All statements, except the compound statement 
and the case statement shall be >7ritten out on 
a single line, as far as possible*' 

In case a structured statement cannot fit on a single 
line it shall bo written with the a ction part indented 
with respect to controlling phrase;' In each of these 
statements, the keywords vhich signify the action- ^part 
( then , else, do ) are placed alonyith the action part'*' 

( 8 ) , ^ statement : 

if <Boolean expression> if <Boolean exp> 

then ' 

<S>^ then 

else OR <S> 

<S‘> else 

<S‘> 

The indentations for the then and else clauses 
are forced only if iiecessary';' 

(9) The whil e statement takes the form: 

wa.5J.e <Boolean expressian> 
d ■ <statement> 
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(10) For rhe repeat statement, we have the until 

aligned below the repeat with the statement 
sequence, which forms the body, indented 
with respect to the opening keyword and the 
controlling "until phrase” 

Thus , 

repeat 

statement 

sequence 

until <Boolean expression> 

(11) The FOR statement is written out as 

FOR <C0NT VAR> j= <init val> (T0/D0WNT0)<f Inal val> 
DO <statement> 

The controlling phrase, if needed ray be written 
as: 

FOR <control var name> := 

<init val exp> (TO/DOWN TO) <final val e«q£> 
or as : 

FOR <control var nante> := 

<init val exp> (TO/DOWNTO) 

<final val 

(12) The prettyprinted CASE statement takes the form 

CASE <case inde 2 {> OF 

, <case e3Bment> 

<case elenient> ' 

.. m . ■ ■ 

<case e3eiBant> 


END 
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where <case element> is v/ritten as 
<Gase constant l±st> : <statement> 
or as 

<case const list> : 

<ststerTient> 

depending on the space available*. 

(13) The last struct lared statement - wriH - is pretty- 
printed ass 

WITH <record var list> 

DC <statement> 

(14) va.R declaration; l£ the identifier list of this 
declaration is long# it is written as ; 

<identif ier list> ; 

<Type denoter> 

(15) Formatting long ARRAY declarations ■v\&iich' cannot 
fit in a line* 

<identif ie3:> « ARRAY [<index type list>] OP 

<Type denoter> 

(16) Layout of records; The body of the record is inden- 
ted with respect to the introductory and closing 
}<;eywords ass- 



RECORD 


<record sectioii> 

<record section> 

♦ 

<reGord secticn> 

<variant part> 

END 

Where <reGord section> takes the form 
<identifier Xist>' : <Type denoter>7 
or as 

<identifier list> s 
<Type denoter>7 

<variant part> is formatted as 

CASE identifier : identifier OF 

<variant> 

<variant>» 

<variant> 

where <variant> is 

<Gase const list> i <field list> 
or 

<case const list> : 


<f ield list> i 
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Prettyprinting long lists: Whenever the list 
threatens to overflow, a lareak is inserted at 
the nearest delimiter separating the list ele- 
ments',' The delimiter may he a semicolon or a 
commal Thus : 

(i) list element 1, list element 2, list element 3, 
list element 4, list element 5, list element 6, 
list element 7 

(ii) (<formal paraspeo ; <fo 2 anal para speO; 

<fonr^l para 

(18) Handling long ejcpressions : The syntax of expressicns 
provides us with a natxaral hierarchy of breaks which 
may be forced wherever necessary. Thus 
<expressicn> may be output as 
<simple exp> RELCP 
<siinple exj£> 
where <sini 5 )le e 2 {p> is 
<term> ADDCP 
<term> 

where <teim>^ is 

<facto20> frRJIiOP 


<fac*,or> 
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(19) Other cases: Proximity to the right margin or 
long variable accesses may warrant a break at 
the assignment operator of an assignment 
statement; if this too fails to prevent an 
overflow over the right margin^ then breaks 
may be inserted at record qualifiers (.) or 
array index accesses; 

3-1;3 The Positioning of Semicolons: 

The role of a semicolon is context sensitive,' Between 
various parts of a program or PROCEDURE block, it acts as 
a seperator; The same holds for the semicolon between a 
PROCEDURE/ FUNCTION heading and its body'* In such cases the 
semicolon may be positioned inarediately after the part 
which it ends; 

In all other cases, the semicolcm can be given a sCTPn- 
tic significance rather than looking upm it as a nere se- 
parator betwe^ two constructs; Within a CONST, TYPE or VAR 
declaration part or a formal parameter list, the SCTiicolcai 
may be viewed as a continuation symbol, indicating that 
another parameter or declaration or a section of a declara- 
tion is to follow; Within the statement pairt, the semicolcai 
acts as a continuation marker, heralding another acticai 
(statement) at the same level at which it was encountered',' 
Similar connotations n^y be ascribed to commas in the 
program text'; To otiphasize this property, the semicolcai or 
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comma must be placed in the same line as the construct 
v/hich it heralds « The same reasoning may be carried 
further on to include operators when line-breaks are 
inserted in an express ion'o 
3-2V TPffi hLGORITHJ 

We have already selected Oppen*s algorithm for our 
purpose. An excellent discussion of its developfnent 
through stepwise refinement is available in his paper 
[OPPE so]. We present a brief outline of the algo- 
rithm followed by the modifications made and its inter- 
face with the p_arser-driver module,' 

3-2 VI A Brief Outline: 

The driver module feeds a STREAM of generated TOKENS 
to t .,ie prettyprinter modileV These TOKENS completely 
specify the pr-'gram text and its prettyprint.ing require- 
mentsV A is either a string^ or one of the three 

delimiters - BREAK^ CONBEGIN (CONstruct BEGIN marker) and 
CONEND (CONstruct END ne.rker)V A STREAI4 is defined as; 

(i) A character string is a STREAM 
(ii) If are STREAM, then the following 

are also valid STREAMS; 

(a) CONBEGIN S, BREAK S VVV' BREAK S_ VV. CONEND 

(b) CONBEGIN CCIIBEGIN BREAK CONEND CCMEaJiD 

(c) CONBEGIN CONBEGIN BREAK CONEND S^CCNEND 

(d) CONBEGIN CCNBEX3IN BRE/^ CONEND CONEND 



27 


A CONBEGIN has three attributes assodated with it: 

(i) Size: The length of the construct it begins + the 
length of the construct, if any, which follows it 
before a '.BRE^iKis encountered'. 

(ii) Offset: The base indentation of any linebreah 
within this construct',' 

(iii) Break Type: The nature of all BREAl© within the 
construct, viz';', CCUSISTENT or INCONSISIENT';' 

In the former case either all or none of the 
BREAKS force a fresh line in the output, whereas 
in the latter, a BRE-AK forces a fresh line only 
if necessary'; 

A conend merely indicates the closing of the construct 
corresponding to the nearest CO^EGIN that precedes it'; 

The size attribute correspcaiding to it is, tharefore, 
irrelevant'.' 

A BREAK also has three attributes associated with it: 
Size: The length of the stream lying between this 
BREAK and tl^ next one at the same level'. 

(ii) Offset: In case the BREAK forces a fresh line, the 
next line is ii^ented *a£fset* spaces over and 
above the base Indentation, with respect to the 
beginning of the construct'; 

(iii) Blankspace: If the BREAK does not force a new line, 
' then "Blankspace** blanks are forced in the output'.' 
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Every string has a size attribute asscxniated with it^ 
which gives the number of characters in it« 

Barring the size attribute^, all other attribute values 
and the positioning of the "delimiters'* themselves is 
governed entirely by the input PL-syntaxV These require- 
ments are met by the parser, which assarf^les . and 
pasaestokens to the prettyprinter module via calls to same 
constant taken generating routines',' The size attribute 
of strings can be ascertained at this point w-ithout any 
difficulty.' 

The routine SCANNER of the prettyprinter module is 
called every time the driver generates a "delimiter" TOKEN 
or validates a string TOKEN. SCANNER deposits these 
TOKENS into a FIFO buffer and ccsraputes the size attribute 
associated with each "delimiter" 'i' If the sum of the 
sizes of all u,iConsumed TOKENS in the buffer exceeds the 
maximum available space, or if an end-cf -input-file 
is encountered, SCiMNER repeatedly calls PRINTER until a 
"delimiter" is encountered whose size has not been evaluated 
or until the buffer is en^tied";' In each execution, PRINTER 
consvimes the next unccaisumed token in the buffer and 
outputs a linebreak, an ind^tation of a string. The choice 
between a linebreak and a plain indentation, as also the 
exafct offset with respect to the left margin are coiqputed 
using bbe attributes of the **<felnniters**. 
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3-2.2 Modixications to Oppen's Algorithm : 

The scanner routine of our implementation remains 
essentially the same as Oppen*s^ vath no major conceptual 
modification'.' The modifications in the PRINTER routine 
are discussed below';' 

During the formatting process, there are many 
instances where a construct has to be started on a 
fresh line or after which a line break has to be forced'. 

In the original algorithm, this is accomplished by insert- 
ing a BREAK with an associated "Blankspace” equal to the 
width of the output medium;' Now a PL may have multiple 
constructs of the former class which are nested or a 
construct of the former class immediately following one 
of he latter',' These occurences will generate multiple 
line breaks in the output, giving it a patchy appearance'; 

To avoid the above anoraoly, a check has to be k^t on 
whether the current line is empty, and a line-break forced 
only if it is not so',' This nay be done in two wayss 

(i) If the line is en^ty, output a carriage retrum and 

indent according to the new offset, Ihis irethod was 
rejected because of the restriction presented by the 
text editor on the DSC-10 vbich converts all carriage- 
returre to line feeds* 


f- 
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(ii) On starting a fresh line, defer Indentation until 
the rirst non—blarik character ccroes in'« Store the 
indentation in a variable, LE:iDBL.’'iNKS and update 
SPACE, which stores the space left on the current 
line as if the indentation has already been 
effected'.' if a line break is forced when the 
current line is still empty, sixnply readjust the 
values of SPACE and LEADBIANKS." 

Another case to be tackled is when a new line is to be 
forced only if the currently available space is less than 
the Space that would be available after forcing the new 
line*" This is easily done by making this check before 
forcing a new line';' 

With the above modif cations, it would not be possible to 
force 

/biar.k lines into the output';' This facility is needed for 
inserting blank lines betwaen procedure blocks'; The drawback 
is overcome by adding a new significance to tfe value of 
'hlankspace'^ which does not serve any role in deciding the 
offset for the fresh line'; If the value of blank^ace is 
twice the width of the output nedium, a rBw line is forced 
unconditionally, otherwise a cteck is made to decide whether 
a fresh line is to be started'; 

In large programs, it is preferable to prevent a pro- 
cedure/ function body from overflowing a printed page as far 
as possible;- For this^ suitable printer control characters 
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can be inserted to skip pages (Form Feed) at suitable 
points. This may be left either to the user’s .discrer- 
tion or attempted by the pettyprinter’,' If the user 
chooses the former^ the prettyprinter retains the 
"page marks" put by the user in the input file’. In 
the latter case, the prettyprirfer uses some heuristics 
in inserting page nu rks after the end of seme procedure 
bodies Obviously, page marks cannot be inserted after 
the statement part of every procedure lest the output 
get too sparse',' Also, the prettyprinter does not know 
the length of the procediare that would follow a possible 
page-break, for that would need a buffer size of the 
order of one printed page I So the prettyprinter singly 
looks out for an end-of -procedure-block after half tl^ 
page has been filled, and inserts a pagemark there',: 

Regardless of the width of the output mediwn, any 
printed line should be small enough to be sized up in an 
"eyeful"'* So the prettyprinter restricts the width of 
the nonblank portion of any output line to a maximum of 
”l$axef£space" characters'^' To ensure this, suitable 
changes have been made in the Printer routine of the 
prettyprinter';' 




C* CONSTANT AND TTPB DECLARATIONS ♦) 


const 

t* SCANNER CONSTANTS ») 

' ;SIZEINFINITY8777777B 

? LINESIZE=125 (♦UNENIDTH+i ♦) 

; UPTPAGE=248? FORCEPAGEsASS 
' CONSTANTS *) 

PRISTKDEPTh=l00;PRISTKaoUNcJa99r 

t,ypf!' ' ' ' 

’(* SCANNER TYPE DECLARATIONS 
LXNEsgacjed array Cl ,, LINESIZEJ of CHAR 

= C STR r NG^OPT^NBREAKr BEGIHCONSTruct , ENDCONSTROC t 

; BREAKS* (Consistent, INCONSISTENT) 

; token srecord 

case TYPEINFQ : TOKENTYPE of 
STRING ? CSTRINGVALU S LINE 
£ LENGTH I INTEGER) 

J OPTLNBREAK 

; (BLANKSPACE : 0 •, FORCEPAGE 
fBRKOFFSET : 0 LINEWIDTH) 
y BEGINCONSfRUCT 

. ^OFFSET ; 9 20 

SBREAKTYPB I BREAKS) „ , 

? ENDCONSTRUCT^EOFHARKER : (DUHBCOMP S BOOLEAN) 

end 

; (* PRINTER TYPE DECLARATIONS ♦; 

PRXSTKBREaKsCCOMPULSORYfOPTIONALfFITS^ 

; PRISTKENTRY 


# 
■ t 


:recQrd 

CURQFFSET S 0 
t EPFBREAKYAltt 

end 

PRISTKUBJect 

’**' IT C Q T* d 

INDEX.LENGIHS INTEGER 
; ITERS : PRlSfKENfry 
end ■' 

LINELENGTHaINTEGER 
C* END OF TYPE DECL *> 


LINEWIDTH 

PRISTKBREafc 



VARIABLE DECLARATIONS 


scanner 


DECLARATIONS*) 


BE(;,ININDE|1^^ IFINOENT^CASEINDENT^STARTINDEBT 

^^^A^FFSPACE# MAXEFFSPACE , LEFT 

rnK^IrPI*'!? * array CO ,, STREAMSIZEJ of TOKEN 
Iriwc^P^ : array to ,, STrEAMSIZEJ of INTEGER 
SCANSTK s array co •« streamsizeT of imteoer 
STR lNGTUKEn,DELlMTOKE^ ? TO® IMit-OER 

SCANSTKEMPtf,PAG£FLA|,OLDPAP:STHTD|CLFLAG 

LCRrr I 

iNPPii i KiSiR'"'"'' “ =“‘'' 

pC|g^^FRlNTER VAR DECLARATIONS *) 

onTc™,.,^Ar,®£r.®J'„J.O^**^P*^ISTKBOUN<iJ of PRISTKOBJect 
fgISTKTgp,PUSHENTRY,POPPe»ITEM : PRISTKOBJect 
LbADBLANKS^^PRiTOP#PRIBOTTOH,OUTHNENO 

PRlSTKEMPT|^LlNEEMPTy,PREVSTRUCT : BOOLEAN 



( ♦ 


SCAi^NEK routine 


*) 


procedure SCANhEKlsCANTUKEfti t TukenJ; 

SCANSTACK MAINTENANCE PROCEOUHES ♦) 

SCANPUSriCX ! INTEGER); 

It SCANSTKEMPtv 
else :« FALSE 

oegin 

ifToh^lmn^ , 

STACK DVEgFLOW) 

^SCANSlilTOP) ss X 
^ end; . « 

t^nction^sCANpop ! integer; 

1£ SCANSTKEMPty 

else 'attempt to POP empty SCAWSTACK') 

degin 

fCANPOP :s SCANSTKCTQPJ ; 

It TOP«BOTTQM ^ r 

then SCANSTKEMPty sa TRUE 

** (TOP-1+STREAMSIZE) mod STREAMSIZB 

endi ■ ■ ■ i 

function SCANTOP s INTEGER; 
oegln 


end 



-SCANSTK 



i^rocMurc- CHt;cKSTK(K : irftGfBj ; 

Mall"'""'’ ‘ '“'•eser; 

It^not SCANSTKEMPty 

' 0'0Q t, n ' 

SAVETOP := SCAWIOP t 

With rUKEi^SfRBAmCSCANlQp) 

'do case TypEiMFO of 

neGlWCOi^STruct t 
if K > 0 

then 
toegin 


TOKENSIZECSCA«POPJ j= 
l’OKe?^SrZECSAVETOP3 

2 


©nd ' 

ENDCOMSTRUct 

oeqin 

. 6 fi d 2 

OTHEHS : : 

oegin 

TUKENSIZetSCANPOPJ :« 


RICHlrOIAL 


end 

(* 


end 

:ase 




♦) 


end 

i* 


IF 




end,' 

C» 


CHECKSTK 


*} 


CHECKSTK(K-l) 

.’I, ■■ ■ . 



procedure APVANCERlGht; 
oegln . 

SMEAMSUE ! 

ena,-"®" 'TOKEN DUEUE OltERfOOT') 


'’""e," “'"'"CEOEFtCr : TOKEN , U ! INTEGER); 

It U >K Q 

tner> 

begin • 

■ ' PRINTER CbR-i') } 

with T 

cto case TSfPElf^FO of 

f * = beftfOTAL-fBbANKSPACE 

OrHERS • PEFTTOTAL + b ? 



end ; 

if LEFT <> RIGHT 
tnen 
begin 



_,^kg5lN£E\Eh^fffi^iisf8i,lfgif¥fJ|§Ki»SIEEaEFXI,> 

(♦ IF l.>0 *) 

ADVAWCELEFT ♦) 


runccion SCANPOPBOIto® : INTEGER; 
begin 

If SCANSTKEMPtF 
then wRiTEbNCTK, 

SCANPOPBOTtO«'?ltesfgtl§?I§S'f 

It TOP » BOTTOM "‘•'i-ouriuM; 

then SCANSTKEMPty ;ib TROE 
end;^®® eOTTOM js tBOTTOM+i) ®od S 

Degin C* CHECKSTREAM *) 

It (HICHTTOTAb - bEFTTOTAbl > EFfS 

■• ■■ . ^ 

It not SCANSTKEMPty ^ -■ 

tnen ; ■ ' *■ .-t .i-i- : 

if bEFI^* 

A0VAl8iliiffliKi;SsTH£AS£ESF|f?| 

■' end^* RIGHT thea-'||^ll| 

ena; CHECK#TftEAH ' 


'I'i'* -'h 



scanner routine *} 

with vSCANTOKEN / 

<30 Chse XXPElNf’O of 
EUKMARKER t 
b®Qin 

if not SCANSTKEMPty 
then ■' ■ / 

oegln 

CHECKSTKCO); 

rTOKENSI^E^bem) 

■4EC;i«|oSsfruct J 

ijegin 

If SCANSTKEMPty 
then '■ ■ 

beoin 

LEFTIOTAU {s 1? 

HIGHITOTAL J» U LEFT := 0?RIGHT :s 0 

end 




SlHlHG S 

begin 

if SCAMSTKEMPty 
begin 


en 4 l' 


.enb... 

(♦ 


end ; 

tMTHEPS s 


end 


AUVANCERIGht? 

•* SCANTOKEN ; 
:s LENGTH; 

RIGHITOTAL :» RIGriTTOTAb ♦ bEi<GTH 


SCA'fiMER ♦) 


CHECKSTREA® 



(♦ TOKEW GENERATING. ROU'i’iNES ■ ♦) 

procedure GEmstking? 

oegin . : 

wJtn iJTRlNGTUKEn 
■ . do, oegln 

T|P|lNFO S= STRING? 

INRUrSTRINa ? 
EENGTh S= STRiNGGENGth 

end 

end? 

procedure CENaREAKCBREAKVAGU,INOENTVALU ! 

. • WfEGER)?'.' 

oegin ' 

witn DEEIMTOKEN 
do oegin 

TrPEINFO js OPTLNBREAK? 
^^AGKSPACF ?= BHEAKVALU ? 
HHFDtfSET :s INDENTVALU 

^ f 'J Cl * ' ' 

hCAjfvtRCL'ELlMTOKENj 

end? 

procedure GENBEGlNCOnC IN0ENTVAGO S INTEGER? 

BREAKKINC ? BREAKS)? 


'.’Jif.n L'ei.imoken 
do oey I n 

BEGINCONStruot? 

(JfESET ?= iNOENTVALU | BREAK 
end I 

SCANNERtUELIMTOKEf?) 
end? 

piocedure GEwendcon? 

De<3ln , 

end? 

procedure GENEUF? 
oegin 

'DEGIMTOKEN - TYPEINFO ?«.■ ■"EQPMARKER 
SGANNERCDELIMTOKEN) 
end? 

procedure XFERRESWOBdCN ?^ INTEGiR')?'' ■ ■, 

K i INTEGER? 
oegin 

Tor K ss 1 to RWfcGTH 
do INFUTSTRINgtKJ S* BCRNT{i^^{KJ f' 
for K j» imlma + u to 'LiWEli&tH 

end? ' ' — - ■ ; , 


BEGINCONStruot? 

INOEMTVAGU | BRBAKTYPE j« BREIKKINO 


fOFMARKER? 


-'‘'A,*/* \ , li l’,/: 



P R I U f E ft ROUTINE 


procedure PBiNTERCJb ; INTEGER 

procedure 

it PRISTKEMPTy 
then PRISTKEMPT] 
else 
begin 

PRITOP J= <PRITOP+i) mod PRISfKDEPTh 
;if PRITOPsPRIBOTTOM 


ry 1= FALSE 


end 


then WRITBLNCTTY, 'PRINT STACK OVERFLOW') 


iPRIStKCPRITOPJ := PUSHENTRT 

#Fidf ■ ■ 

procedure PRISTKPOP? 
oegin 

If PRISTKEMPTy 
then 

WKITELNCTTY 

, 'ATTEMPT TO POP EMPTY PRINTSTACK') 
JPOPPEDITEM ss PRISTKtPRITOPJ 
?if PRITOpsPRIBOTTOM 

then PRISTKEMPTy ss TRUE 
• else 

PRITOP 

;= (PRITOP-l+PRlSTK0EPTh) mod PRiSTKDBPTh 

end? 

procedure GETTOPPRlStK? 
oegln 

if PRISTKEMPTy 

tnen WRITELNCtTY, 'PRISTK IS EMPTY*) 

?PRISTKTOP := PRISTKCPRITOP) 
end; 



C* SRING OUTPUT AND INDENT ROUTINES ♦) 


procedure indentc amount : integer)? 
begin 

SPACE ;s SPACE-AMOUNT 
;EFFSPACE ts EFf SPACE-AMOUNT 
?lf EINEEMPTY 

then LEADSLANKS := OEADBLANKS+ AMOUNT 
^ise WRITECOUTPUT, ' ' i AMOUNT) 

end? 

procedure outnenlinecamount s integer)? 
begin 

if not OINEEMPTY 
then 

if SPACE < MAXSPACE-AMOUNT 
then 
begin 

lilNEEMPTY ;= TRUEiWRITELN 
? SPACE :» MAXSPACE-AMOUNT 
?LEADBLANKS := AMOUNT 
? if SPACE > MAXEFFSPACE 

then EFFSPACE !« MAXEFFSPACE 
else EFFSPACE SPACE 
JOUTDINENO !» OUTOINENO +1 

?oLdpage :s true 
end 
else 
begin 

WRITECOUTPUT, ♦ • ? AMDUNT-MAXSPACE+SPACi 
?SPACE := SPACE-(A«0UNI-CMAXSPACE-SPACE- 
? EFFSPACE 

:= EFFSPACE-(AMOUNT-(MAXSPACE-SPACE)) 

end 

else 

begin 

LEADBLANKS ;= AMOUNT 
?SPACE ss MAXSPACE-AMOUNT 
?i£ SPACE > MAXEFFSPACE 

then EFFSPACE ?* MAXEFFSPACE 
else EFFSPACE SPACE 

end: 


end? 



procedure OUTNEWPAGECAMOUNT i INTEGER)? 
oegin 

if OL0PAGE 

and (((COUTblNENO rood 88) > 45) 

and not PAGErLAG) 

or {{ELEMENT , BLAMKSPACE * FORCEPAGE) 
and PAGEFLAG)) 

then 

begin 

»iRITELN?PAGE{OUTPUT)?LEA0BLANKS := AMOUNT 
yOI^DPAGE := FALSE 
; SPACE :» MAXSPACE-AMOUNT 
f if SPACE > MAXefFSPACE 
then EFFSPACE i* MAXEFFSPACE 
else EFFSPACE !« SPACE 
^ ;OUTLINENO ;= tiLINEEMPTX is TRUE 
end 
else 
begin 

WR1TELN?LEADBLANK5 :» AMOUNT 
?0UTLINEN0 sa OUTLINENO t I 
?L1N£EMPTY :* TRUE 
, 'SPACE is MAXSPACE-AMOUNT 
Jif SPACE > MAXEFFSPACE 
then EFFSPACE :* MAXEFFSPACE 
else EFFSPACE J= SPACE 

end 

end# ■ ■ ■ ■ ■ 

procedure wRiTEOUTSTringCVALUE s LINE 

;N0NBLANK5Ize S LIHELENGTH)! 
begin {♦ NRITEOUTSTRING ♦) 


if (VALUECLINEWIDTH +13 <> * ') 

or not LINEEMPTY 

then 

begin 

if LINEEMPTY 
then 
begin 

WRITEtOUTPUT,* * S LEADBLANKS) 
? LINEEMPTY S* FALSE 

end 

?WRITE{OUTPUT, VALUE J MONBLANKSIEE) 
? SPACE is SPACE^NONBLANKSIEE 
; EFFSPACE ;a EFFSPACE-MONBLANKSIEE 

end 

else 

end? 




begin (» PPINTER ♦) 

«itn ELEMENT 
do case TyPEXNFO of 
BEGINCONSTruCt 
! begin 

with PUSHENTR^ 
do bealri 

do if L > EFFSPACE 
then 
oegin 

if BREAKfyPEsCONSISTBMT 
then EFFBREAKVALU := COMPULSORY 
else EFFBREAKVALU :s OPTIONAL 
;CUROFFSET :* SPACE-OFFSET 

end 

else, 

begin 

CUROFFSET Ss OJEFFBREAKYALO :» FITS 
end ' 

'6nd ' ■ ' ' ■ ■ 

yPRISTKPUSH 

end 

? ENDCDNSTHUct : PRISTKPOP 



UP’ILNBHEAK 


Deol n 

GETTOPPRIStK 

• items 

FITS I oegtfi 

. ^INDENTCBLANKSPACE) 

©nil* 

f COMPULSORY 


EFFBREAKVAlU (ft 


f OPTIONAL 


|f CBLANKSPACE < OPTPAGE) 

OOTNENLINECMARGIII 

-PRISTKTOP , 
♦BEKOFFSED 

©is©' 

OUTNEWPAGBCHARGIN 

-PRISTKTOP , 

+ brkoffsetJ 


ITEMS , CUROFFSET 


ITEMS , CUROFFSET 


pegln 

if h > EFFSPACE 
then 
t>egln 

if BLANKSPACE f OPTPAGE 
then 

OUTNEWLINECMARGIM 

-PRISTKTOP , 

else +BRKOrPSEI) 

OUTMEMPAGECMARGIII 

- PRISTKTOP 

4- BRKOFFSET) 


ITEMS 

, CUROFFSET 


ITIMJ 
, CWR{ 


IFFSST 


end 

else 

«g|„ 


end 


? OTHERS 
end 


e’nd 


end 


NDEHTCBLANKSPACE) 


; STRING 

: begin 

WRITEQUTSTringCSXRIHGVALUtLEIjGTH) 

end 

? OTHERS : Wi 
end (♦CASE 


WRITELMCTTY^^NRONG TYPEINFO') 


TOKEN TYPE ♦> 


(♦PRINTERS) 


CHAPTER 4 


IMPLEMENTATICf? OF A PRETTYPRI1>JTER - PHASE II 

THE parser-driver for the ALGORIIHM 

Our algorithm presumes the existence of a driver^ which 
feeds the entire input program/ fortified with the *’delimi— 
te . s'* which dictate the prettyprinting requirements/ to it. 
The rules to position these delimiters in the program text 
are simple: 

(i) Identify those constaructs in the language 
which shoxild preferably occupy a single 
line or which contain line breaks whose 
offsets are defined relative to the actual 
start of the construct'.' 

(ii) Enclose all such constructs within the 
bracketing ''delimiters'* - COIBEGIN and 
CONEND'.' 

(iii) Insert the "delimiter"/ BREAK vherever a 

construct must break a line or may be allowed 
to break a line',' 

The values of the attributes associated with CCftJBEGIN 
and BREAK are determined by the prettyprinting standard 
of the PL* 

The driver can thearefore be realized by modifying 
the parser of the input plT Since the parser of a PL 
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follows directly frcm its grammer, enciching the granmar 
to handle prettyprinting information automatically 
solves the problem* We therefore introduce three special 
types of non-terminals , - CONBEGIN, BREAK and CCSsFEr© - ^ 
fortified with their corresponding attribute values into the 
granuTvar « ihe syntactic Implications of these non-terminals 
being nnll^ each of them# in turn# map onto the null string,' 
However # the procedures corresponding to these ncai-terrainals 
construct and pass prettyprinting information to the pretty- 
printer',' Thus the entire prettyprinting information is 
represented in the grammar of the language instead of 
getting burried unobtnasively in the code';' 

Once the grammar is written# the parser— driver follows 
directly,' The process is very single for recursive descent 
parsers# which can drive the prettyprlnter directly, without 
using a parse tree';' Our prettyprlnter for P/kSCAh uses this 
approach ■*■ The parser validates the character tojcens of tte 
input program',' through the procedure ACCEPT m d sends 
them to the prettypr inter';' To transmit blanks between two 
lexemas, the lexical analyser makes direct calls to the 
prettyprinter";' The string tokens are assembled by the 
process “GENSTRING';' The TOKENS for "delimiters” are 
assembled and transmitted to the prettyprlnter by the 
procedures GENBREAK, GENBEGINCCSJ and GENENDCOw; Using a 
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parser aufccxitaticaiLy enables us to congress redundant 
blanks. However/ it also erases useful infoxmation 
like coiiinents.' For this, the procedure COMMENT in 
the lexical analyser is modified to generate its own 
string tokens and BREAK tokens at every blank in the 
Gcmmenf* However/ the structure of the comment has no 
relevance to the PL structure. So twe dimensicnal 
comments for possible pictorial representation may 
get completely blotched up by the prettyprinter'.' 

4-2. A DESCRIPTION OF IHE IMPLEMENTATIC»I 

A listing of the constant/ PL- independant procedures 
of the implemai tation is attached. To serve as examples 
of the prettyprinted output c£ our packa^ they have 
been prettyprinted using*. partly one program and partly/ 
another',' 

The inherent PL independence of the basic algorithm 
envied us to simultaneously develop two programs catering 
to the two different prettyprinting strategies chosen'. 

The parser-drivers of both programs were handcodeid/ and 
followed directly from the graimers/ which are attached. 

The success of our technique of first writing the 
pmttyprinter graronar/ whose correct formulation ensmred 
the correctness of the parser-driver "iAiich followed/ 
encouraged us to write a prettyprinter grammar for Module— 2 
taken frem the Modula-2 report [wiRT sq]’.' ^Ehe parser. 


v/ere 



pRerTypRiNTfiR awnm for moj)ula.2 


5 V 

galident 

mmm'^ 

t 2j 

Constoecl 


f 3) 

ConstExp 


C 43 

SlmpleConstExp 


C 5J 

ConstXertR 


r 63 

ConstFact 


r 7i 

set 



1 «] 

element 

W MW ^ 

C 9] 

TypeDeci 

- — > 

Clt3 

type 


|t23 

llil 

SimpleType 

enunern 

IdentLlst 

mrnm^ 

mmmy 

£153 

SubrangeType 

mrnmy 


Ident f idemt } 

" 0rBaic(O,6> ConstExp 

H^onano. 

ConbeqinCO, inconsistent) 

Con2nd ^ Break(0,2) SlipieConstExp 

Conbeglnto, inconsistent) l | j 
Conend^^*^* ^ A^JciOp BreakC0,2) Constfer® ) 

Conoegin(oanconsistent) 

^ ^«iOP »rea>cC0,2) ConstFact ) 

Conbegin CO. inconsistent) 
gallaent i number I strlno I set I 

• *^0'^ ConstFact ' 

Conend 

qalident 3 

Conbegin CO, inconsistent) 

*’{•* relement < Brea] 

Conend 
Conend 


'Breaic(0,2) element >3 •»>« 


Conoegin CO, inconsistent) 

^ *•" 8teaK:C0,2) ConstExp 3 

conend 

ConoeglnCO, Inconsistent) 
ident ereafcC0,6) type 
Coneno 




gai ident J enumern I Subrangetyp 
ConpeginCo, inconsistent) *C* Ide 


.pnbegln CO, inconsistent) 
ident { i)reaic(0,2) ident ) 

Conend 

ConbeglnCO, inconsistent) 

"C* Constfexp Brealc(0,2) ConstExp 
Conend 


pe 

entList Conend 


m 



ri5j 

ArrayType 


C17) 

RecordType 

mmrnm^ 

riBj 

FldLstSeq 


£191 

FldPst 

mmm^ 


idOi 

variant 

-«-> 

£21) 

Case£,abl,st 

-— > 

£223 

Castiiabeis 

mmm^ 

C?33 

Set Type 


£243 

Pointer ryr><® 

mmm'^ 

£253 

ProcType 

mm» y 

£263 

For-nTypeLst 

mmm'^ 


C27i 

VarDecl 


£283 

designator 

> 


sim^el^pe > 

von^ndi 

gECORD 

Conend FldLstSeq Break: (max, 0) EMD 

Conend*^^ < 8reaic(0,0) Fldbst h 

9?222if?9^24''S®?^slstent) 
rxIr^VH^ *.8 type { 

CASE {ident'*j*J qalident fiF Breaic(ma».rA«»f n^nnt' 

ConheoinCO,consJstenU ^reaKimax^caselndent, 

varaint { "|» BreaJc(0,0) variant > 

Conend iBreak C max, case Indent ) 

3 Break (max, 0) mo 

V 0 * 1 © flQ 

ConbeglnCO, inconsistent) 

Conend^^^"®^ * Break(0,6) FldLstSeq 
Conpeain(0, consistent) 

ConlSd^^®^® ^ Casebaoels > 

^oBoeglnCO, Inconsistent) 

«godstExp t •*.,• 8reak(0,2) ConstExp J 

inconsistent) SET OF SiwpleType 

^22223^''^® . POINTER TO type 

t«.Qn©TiO 

PROCEOURE £ For»Typel»st J 

vOnend 

ConoeglnCO, inconsistent) 

Conend Formfype I CVAR) ForfftTypeH*)" 

„£ •*:« Breaic(0,4) qalident J 

vonena 

ConbeqlnCO,inconsisttent> 
identEist *?" 8reakC),6) type 
Conend 

Conbeqinp* inconsistent) 
qalident i “ « Break£0,2) ident I 
„ . C" Explilst "J* I ) 

Conend 



C29j 

ExpLlst 

OOJ 

expression 

on 

SimpleExp 

E323 

term 

E33J 

factor 

f34j 

Actual Par 

f35J 

statement 

r363 

assign 

E373 

ProcCaii 

E,:383'' 

StmtSeq 

(39) 

IfStmt 












mmm^ 


mmmy. 


expression } 


Conend 

Conend ^ «eiOp Break:C0,2) SimpleExp 3 

”?^5f^l'^[Uif''^9nslstent) l •*♦•’]«-•* i 
Conend* BreaX(0,2) term 3 

CoS?nd ^ Brealc(0,2) factor } 

stent) 

„2(5'’2jpieisfl2’”j-”\AT'*flJ?S?'°’’‘'“'‘‘‘*‘'’"’ 

rftnni*rt< n fn - < J -t-l-JY* *■**'' '•*'* 




Conend 

^ ! IfStmt I CaseStit 

ConlJi'^®^®^ ’ Brealc(0,6) expression 

Conoeglnp, Inconsistent) 
designator f ActualPar 3 
Conend 

ConoeglnCO, inconsistent) 

^ ' 8realc(0,0) statement > 

ConoeglntO, consistent) 

Conbeoln (0 , Inconsistent ) 

BrealcCO, if indent) 

Conend 
{ Brea(cC0,0) 

Cgnbegin Co, Inconsistent) 

EiiSIf expression THEN 
BreateCO, If Indent) StmtSeq 
conend ) 
f BrealrCOrO) 

|BSE^BreaicCO, if Indent) stitseg 

BrealcCO, 0) END 
Conend 



f40 J 

Case, Stmt ■ : 

«•»> 

[41 J 

cssei.;::'; > /■ 


C42:! 

■ ’^Diieatmt 


143) ' 

Repeatst»t 

mmtm^ 

r44j 

'■ SSSSiS 



Hi"-"'' 

" PoopStmt 

— > 

[46J 

4ithStmt 

- — > 

[47] 

ProcDecl 


C48j 

^rocHeart 

•w *!* 'we ^ 

C49J 

block 



Inconsistent) 

8reafcCmax,0) EisjD 
Conend 

stent) 

Conend^^^®^ * 8rsaic{u,8) stmtse^i 
istent) 

Conend 

ConoecflnCu.conslstent) 

StmtSeq 

yi^ril, expression 

u c#-fi-©iici' ■■■■■■ 

ConoeglnCd, consistent) 

ConoeginCO, inconsistent) 

expression 

j,|j^|^jBteafc(0,0) ConstExp ) 

Conend i>o 

j,®JJ®®^fO:»*'®tindent) ■ stmtSeg BreateCO,©) EMp-' ■ 
Conoegln (Of consistent ) 

Conend ■ . : 

"njdegiftCo, consistent) 

Prealc(0,wltnindent) 

.Stmtseq dreaK(0,0 ) EnD 
Conend ■ '.■ 

Break (max *0) 

t„onbegin 0 . Inconsistent ) 

CDS?Sf*^r. 5 ;(S 5 ; 5 ? 55 “*''’"=‘''‘*"'‘' «»"' «*■>* 

"8SEA3lini2»l'2‘^®*^sistent3 
PROCEDURE ident C FornPar 3 
Conenct 

Conbegin CO, inconsistent) 

Conend 



rsoj 


rieciaratlon 




r53j 

t54] 


r55j 

tSfe] 


r573 


FPSectlon 

For^nType 

'‘io-'JulePecl 


priority 

export 


import 




*> 

•> 


.> 


mmmy 


ConSnd®"'' > 

«rfa>c(i»ax, 0 ) r 
^S5|S5|«iS/|coosislfnt) 
f*XnKa«?®f ' <i8cl Indent ) 
ConbeainC0» inconsistent) 
i ContoeglnCi), inconsistent) 
Conena SreaicCO^O) f 
Conend 



lypeoeci 


Cjnhe«S(o;iSc9nn5tent)’ 

Conaad: ■ ■ 

, 'frea!c((iiax>0) t 

" “'iJSSgfSi^Tg!’ *>■ 

«oduleDecl ”;■ 




r « - FPSection i 
I **s" qalident ) 
Conend 


?** BrealcCOa) FPSecn > ) 


inconsistent) 

cronend "* area(cC0,6) ForaType 

tftRPtAy OF) qalldent 


Conend 




{ import Breafetmax^O) , 

BreateCmax^O) ) blocfc ident 


Brea!c((nax# 0 ) 


Conend 
’’t" Inteaer *1" 
Conoeg|n|o inconsistent) 
_EXPngT C OUSbXriED 1 Br< 


Conend ' ^ ®rea)cC0,4) identliist «' 

stent) 

Conend ^ 8reaSc(0,43 Identbist 



rsej 


Cv'Jj 


[ 5.3 j 


tbl J 


fjef n^o-iuie 


fief inltlon 




?ro;i'’^ 0 'i'Ule 


CompiXiinit 


»**»•> 


oreak(fflax,«odulelndent 3 
f |*'®aiSi»^x,moduleindient) } 

Codend 

CO?jSf ttreaicCO,declindent) 
Conbeqiri(0,lncpn$istent) 

{ j^onoeginCO, inconsistent) 

C»niS 3 "^ Br«i.(O, 0 ) ) 

Conend 

IP& ^feffcCO^decltn-lent) 
Inconsistent) 

i »-onbeginC0, Inconsistent) 

Icient C "s" type J •*;“ 
area(c(0,0) > 

Conend 
Conend 

"onoegln Cp , Inconsl s tent ) 
jfAHBreaki Of deci indent ) 

i Conbegtnto, inconsistent) 


ConstOecl 


I 


^ ^ s (far %• .f * 

BreaicC 0 , 0 ) } 


Conend 
Conend 

Conend ,i 

Conoegtn'CO.inconsistent) t 
FrocHead ' 

Conend 

ConoeglnCO, inconsistent) 

MdDule ident c priority I *»?’• 

. Bcea*cf«ax,fflOduieindent) 
»reiic(max,moduleindent) 
olocK ident 
Conend 

Oefndoduie I 

C IMPLEMEMTAfiOfi • i ProoModule ", 


VarDeci "t" 



35 


with error re:;overy^ for Modula-2 being already available, 
v/e do not forsee any difficulty v^hatsoever, in writing the 
parser-driver. The same confidence has led us to explore 
the possibility of automatic generation of prettypr inters 
in the next chapter',' 

4—2 ;’l User Options; 

The user can exercise bhe following options when using 
our prettyprinter ; 

In all cases, default options are given in paren- 
thesis ; 

I'.' WIDTH OPTIONS : 

(i) Width of the output medium (124) 

(ii) Maximum possible width of the nonblank 
portion of any line (70) 

II'.‘ INDENTATION OPTIONS FOR: 

(i) Start of the output: This is to facilitate 
the prettyprinting of a nested procedirre in 
a large program. If the procedure has xjtnder- 
gone any changes, it own then be prettyprinted 
alone, and reinserted into the original 
program';' Thus • 

Start indent = indentation for a blank 36 


(level of the jarocedure-l ) 
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(ii) ocedure/f urr: tion/program block (6) 

(ill) Declarations (4) 

(iv) Records (5) 

(v) Conpound statement (3) 

(vi) If statement (2) 

(vix) Repeat statement (5) 

IIIV OTHER OPTICXJSs 

(i) Whether to insert new page inarks or to 
retain old ones.' 

(ii) Whether breaks after statements/declarations 
should be forced consistently after every 
statement or declaration or forced only when 
necessary 'a 

4~2','2 Shortcomings of the Implementation 
(i) The user has to estimate his own indentations to 
prevent the program from running off the right 
rr&rgin due to excessive indentation. Even this 
may not save the program if it has too many levels 
of nested procedures and/or control statements',' 
(ii) Large, multi-line comments may be broken up 
improperly',' 

(iii) Comments meant for any construct should be gi-yen 
before closing the construct, otherwise ttey may 
not be positioned properly with respect to the 
construct they are meant for'. 
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(iv) Insertion of "page marks'* may not be very 
meaningful due to the limited amount of 
look-ahead (cane line). 
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grairanar oE every PL 7 However# a closer examiiftion of the 
characterlst.t.cs of these “delimi'cers" and their attributes 
reveals a better solution# involving less sweeping chcnges. 

The "delimiters” and their attributes are Invariant 
for any PL7 The procedures representing tliem are also 
constant procedures like our basic algorithm. All tl^se 
symbols can therefore be considered as the terminal symbols 
of the metalanguage used to describe the granmar of any Ph. 
The transformation rules of the metalanguage can specify 
the semantics of tfese symbols as cal Is to the procedures 
represented by them." "niese modifications have been carried 
out on the metalanguage formulated by Datta [dATT 8l] to 
specify the syntax of LL(1) languages! The grammar describ- 
ing our metalanguage is attached as Fig! 5717 

5' TRANSFCRMATION RP^jES TO INTERPRET THE EXTENSIONS 
TO THE METALANGUAGE. 

The transformation rules for the ^metalanguage that we 
have adopted rena in the same except for our local extern ions'. 
We discuss these extensions in this section! 

Changes in the Lexical Analyser-Generator 

The generated lexical analyser should store every 
recognised let --erne and its length and make a call to a 
constant procedure GENSTRING vilch forms a token to be passed 
to the prettyprinter! The procedure to handle comments in 
the generated lexical analyser should also store the ccroment 
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and its length and make calls to oenstring and then the 
ptettypninter « Long Goimwants should be dvided into sepe— 
rate lines by positioning linebreaks through calls to 
GENBREAK between two prettyprinted strings of the caranent, 

5— 2 m2 Introduce a call to the prettyprinter routine 
in the constant procedure ACCEPT^ which validates a given 
lexone at any point of the parser' 

5-2 ';3 Introduction of a procedure* IDENTyMLIST, in the 
parser generator v/hich tabiolates the various indentations* 
their default values and their permissible raiges'.' 

IDENThYMLIST deposits every valid indent name and the 
corresponding values into an “indent -table When the entire 
list is processed* it is sorted on the indent names key'. 

Now whenever the value corresponding to an II®ENTSAME is 
required the routine does a binary search in the table and 
returns the default value corresponding to the INDENTNAl'-E'.' 

5-2,4 Now if all the indentations specified in 
INDENTSYMLIST are to be teated as constants* then only the 
procedinres CONBEGINFACT* CONBEGIN and BREAKFACT are required'. 
However, if some options are to be left to the user then two 
more procedures have to be added in the generator as given 


belov7i 
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^ ^ 5-2. Procedure M.VmDmT, *ioh generates an 
initialization procedure to declare the mDEinT,lBli; of 
required size in the generated program and produces an 


initialisation procedure to initialize the table according 
to the- procedure IKDENTLIST described before i 

5-214.-2 Procedure OPTIONS, which generates a procedure 
n the options exercised by the user, and rfiodify the 
indent table Initialized by MiPlNDENTl The syntax of the 
option list could be 


indentn^ne 1 - value 1; indent name 2 = value 2; 

The optlcns list must occupy the first line of the 
input -program to the prettyprinter. So when the- parser 


of the prettyprinter finds that the first symbol in the 
text is a it transfers control to options, which checks 
the value for permissible rage, scans the Indent t*le for 
the- given name, and modifies the associated value. Now 


IIJDENTSYMLIST should be modified 


to pass the index rather 


than the associated valued' 

S™enSa?orr procedures 

(i) CONBEGINP^CT: Processes a CDNEEGInfACT and generates 
a corresponding procedure call viz.’^ GENBEGINCON 
(INDENTTi\B[i]^ Selector val), where I is the index 
corresponding to the INDENTSYM 
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(ii) COI^NDPACTs Acce-nf*<5 -hKia 4- 

iccepts the terming **cafEND** 

and generates a call to the procedure 

GENEMXrCW. 

PrOGOS=?fiQ -t-K^ j 

t'rocesses the production corres- 
ponding to the name and generates a procedure 
call - GENERalK (lMDT--iB{i), indtab[jJ) where 
I and J are the indices corresponding to 
IHDENTSiMl and INDENTSlMa respectively 
5 3. Having derived the prettyprinter grammar and the 
parser generator for the same, we are now capable of 
generating the driver module of the prettyprinter for 
eny LL(1) language’.- The prettyprinter module, which is 
constant feature can then be combined with the genera- 
ted driver module to form the prettyprinter for the 
language';" 



CHAPTER 6 


CONCLUSIONS 

-As stated in the introduction/ v^e are '.interested in 
seeking a modularisation that cleanly divides a consti- 
tuent of a PE into PL— dependent, 03— dependent and appli- 
cation dependent parts';' This division greatly helps in 
locating those aspects of a PE that can be autcraaticaily 
generated',' The clarity with vhich we visualize inter- 
faces between these three aspects of a pE package can only 
help in reducing the overall systems programming burden',' 

Our view gains further credence frcxn the experience of the 
Sperry-Univac research team concerning their advanced 
interactive debugging system, the Universal Compiling 
system and the General Syntax /malyser [su 79], 

In the application we have chosen, the Operating System 
plays no role',' We only have the PL-dependent part and the 
prettyprinting routines - SCVN and PRINT'.' The former part 
consists of a pjarser that generates this information and 
invokes the prettyprinting routines,' 

The above delineation is entirely Oppen’s [OPPE 80 ]; 
we have only adapted his solution by our local experience. 
Nevertheless, we feel that this style of construction of 
PEs is a clear pointer for future direction. We had attemp- 
ted a similar approach to the development of interactive 
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aebugging fa .iiities, a discussion on which is not 
Eresented In this report^ Though this work did not 
have the sa„^ depth as in the case of prettyprinters, 
we were able to discriminate between the various 


aspects of PE modularisation'. This leads us 
that the ^proach is sound and merits further 
other applications as well.' 


to believe 

use for 
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